Phishing URL Detection: A Real-Case Scenario Through Login URLs

نویسندگان

چکیده

Phishing is a social engineering cyberattack where criminals deceive users to obtain their credentials through login form that submits the data malicious server. In this paper, we compare machine learning and deep techniques present method capable of detecting phishing websites URL analysis. most current state-of-the-art solutions dealing with detection, legitimate class made up homepages without including forms. On contrary, use URLs from page in both classes because consider it much more representative real case scenario demonstrate existing high false-positive rate when tested pages. Additionally, datasets different years show how models decrease accuracy over time by training base model old testing recent URLs. Also, perform frequency analysis domains identify carried out phishers campaigns. To prove these statements, have created new dataset named Index Login (PILU-90K), which composed 60K URLs, index websites, 30K Finally, Logistic Regression which, combined Term Frequency - Inverse Document (TF-IDF) feature extraction, obtains 96.50% on introduced dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LEARNING TO DETECT PHISHING URLs

Phishing attacks have been on the rise and performing certain actions such as mouse hovering, clicking, etc. on malicious URLs may cause unsuspecting Internet users to fall victims of identity theft or other scams. In this paper, we study the anatomy of phishing URLs that are created with the specific intent of impersonating a trusted third party to trick users into divulging personal data. Unl...

متن کامل

A Review on Phishing URL Detection using Machine Learning Systems

Seeking sensitive user data in the form of online banking user-id and passwords or credit card information, which may then be used by ‘phishers’ for their own personal gain is the primary objective of the phishing e-mails. With the increase in the online trading activities, there has been a phenomenal increase in the phishing scams which have now started achieving monstrous proportions. This pa...

متن کامل

Lightweight Phishing URLs Detection Using N-gram Features

Phishing is a kind of attack that belongs to social engineering and this attack seeks to trick people in order to let them reveal their confidential information. Several methods are introduced to detect phishing websites by using different types of features. Unfortunately, these techniques implemented for specific attack vector such as detecting phishing emails which make implementing wide scop...

متن کامل

Security Evaluation of Pattern Classifier against Phishing URL Detection

— Pattern classification is a branch of machine learning that focuses on recognition of patterns and regularities in data. In adversarial applications like biometric authentication, spam filtering, network intrusion detection the pattern classification systems are used. Extending pattern classification theory and design methods to adversarial environment is thus a novel and very relevant resear...

متن کامل

What’s in a URL? Genre Classification from URLs

The importance of URLs in the representation of a document cannot be overstated. Shorthand mnemonics such as “wiki” or “blog” are often embedded in a URL to convey its functional purpose or genre. Other mnemonics have evolved from use (e.g., a Wordpress particle is strongly suggestive of blogs). Can we leverage from this predictive power to induce the genre of a document from the representation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3168681